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50 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: / 

5 

(i) APPLICANT: Stiles, Michael E. 

Verderas, John C. 

Van Belkum, Marius J. 

Worobo, Randy W. 
10 Worobo, Rodney J. 

McCormick, John K. 

McMullen, Lynn M. 

Leisner, Jorgen J. 

Poon, Alison 
15 Franz, Charles MAP 

(ii) TITLE OF INVENTION: NOVEL BACTERIOCINS , TRANSPORT AND VECTOR 
SYSTEM AND METHOD OF USE THEREOF 

30 (iii) NUMBER OF SEQUENCES: 52 

iii (iv) CORRESPONDENCE ADDRESS: 

!*" (A) ADDRESSEE: Fish & Richardson 

W (B) STREET: 2200 Sand Hill Road 

iU25 (C) CITY: Menlo Park 

(D) STATE: CA 

jj (E) COUNTRY: USA 

„!" (F) ZIP : 94025 

J;f30 (v) COMPUTER READABLE FORM: 

]fl (A) MEDIUM TYPE: Floppy disk 

M (B) COMPUTER: IBM PC compatible 

*1 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

p (D) SOFTWARE: Patentln Release #1.0, Version #1.25 

i'f 35 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Green, Grant D. 

(B) REGISTRATION NUMBER: 31,259 

(C) REFERENCE/DOCKET NUMBER: 07254/051001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-233-2126 

(B) TELEFAX: 415-854-0875 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME /KEY : RBS 

(B) LOCATION: 24.. 23 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 38.. 184 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 38 . . 125 

(ix) FEATURE: 

(A) NAME /KEY : primer_bind 

(B) LOCATION: 114. .144 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

IS! SrSrO^TIom /l.be!= Restrict.site 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATCTTGGTAT CACAAACTAA TTTGGAGGTT GGTATAT ATG AAA AAA CM ATT TTA 



1 5 



£ S S SI S SI SI S 2 S S S £ EE S 

™, ™a rrr TCT GCT GTA AAT TAT GGT AAT GGT GTT TCT TGC 
T S S G^ £ £ £ val A S „ Tyr S l y As„ Sly V.l S« Cys 

25 30 



ACT AAA ACA AAA TGT TCA GTT AAC TGG GGA CAA 
Ser Lys Thr Lys Cys Ser Val Asn Trp Gly Gin 
40 45 



55 



103 



151 



184 



(2) INFORMATION FOR SEQ ID NO: 2: 
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M , sEOUENCE CHARACTERISTICS: 
{1) SEQUENCE ngtH; ^ aclds 

(B) TYPE: amino acid / 
(D) TOPOLOGY: linear ' . 

5 {ii ) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
10 Me, L.s L,s cm He Le. L, CIV ^ ^ XX. V* Val C, * 

20 

" 0. - «, V. „, C ~ To - - «■ ~ "f 5 - " P 



u 

i3 Gin 

20S 



2S 



(2 ) INFORMATION FOR SEQ ID NO: 3: 

,n SEQUENCE CHARACTERISTICS: 
U) S) LENGTH: 4290 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(i i) MOLECULE TYPE: DNA (genomic) 



i (vii) IMMEDIATE SOURCE: 

J (B) CLONE: Leucocin A genes 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: rrrTRCTTC 

rrBT CATATTTTGA CTACCAAAAA ATATGCGTGT TGCGTACTTC 

AATTTTTGCC CATGCTGCAT GATATTTTbA n . waTTTrTT T 

40 Taa ACA.TAATTCC TCTGACAAAG CTAGTTATAT TAATTTCTTT 

AATGTTGATA ATTTTTTTAA AGATAATTCC ^ TrrA r T AT 

n-r* RCTGCCTTCT TTAATTTTTA CTAATGGAAA CATGGACTAT 
CAAGAGTTAA ATATTTCTCA ACTGCCTTC 

45 AAACGATTAT CAATTTATAC AAT^ ~ 

GACGAATTAA TTTCAAAACA . = ^ ^ 

AAATGTTTCA AAAAAGAATA AATTATAT ^CTACGGG 
50 wt APTCATTACA AAACACGCCT GTCCTTAGCC AAACTA 

CTGCACTCGC TATGGTTTTA ACTCATTACA rrTGCG AATG 

rrairra cTGCTTTAGG CATTGTTAAA GCTGCGAAlu 
ACCTGGCCAA AACTGACATG GAAGGAACGA CTGCTTTA 



60 
120 
180 
240 
300 
360 
420 
480 
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CGCTAGACTT TGAAACCATG CCGATCCAGG CTGATTTGAG TTTATTCGAT AAAAAGGATT 540 

TACCCTATCC TTTTATCGCC CATGTCATTA AAGACGGTAA ATACCCGCAT TAT TAT GT AG 600 

TTTATGGGAT CAAAGGTGAT CAGCTATTAA TCGCTGATCC AG AT AAT AC C GTTGGTAAAA 660 

AT AAAAT G AC AAAAGCGCAT TTTAATGAGG AGTGGACCGG TGTGTCCATT TTTATTGCGC 720 

CCAATCCAAC CTACAAGCCA ACAAAGGATA AAAAGCGTTC CTTGACTTCT TTTATTCCAG 7 80 
TGATTACGCG TCAAAAATTA TTAGTTATCA ATATTGTCAT TGCTGCCTTG TTGGTTACCC 840 
TAGTGAGTAT TTTAGGATCA TATTATTTGC AAGGTATCAT TGATACCTAT ATCCCCGATA 900 
AC AT GAAAAA CACCCTAGGG ATTGTGTCAC TAGGGCTTAT TTTTGCGTAT GTTATCCAAC 960 

AACTGCTCTC TTATGCCAGA GATTATTTAT TAATTGTCAT GGGGCAACGC CTCTCAATTG 1020 

ATATTATTTT GTCTTATATC AAACACATTT TTGAACTGCC AATGTCTTTT TTCGCGACGC 1080 

GTCGTACCGG TGAAATTGTG AGCCGTTTTA CGGACGCTAA TGCCATTATT GAAGCCCTGG 114 0 

CAAGCACGAT GTTATCTGTA TTTTTAGACT TAGGAATTTT GGTCATTGTT GGCACAGTGC 1200 

TAGTGGTTCA AAATTCAACC TTGTTTCTGA TTTCTCTGAT TGCCATTCCG GCTTATGCCC 1260 

TAGTGGTCTG GCTCTTTATG CGTCCTTTTT CAAAG AT GAA TAATGACCAA ATGCAAGCAG 1320 

GTTCGATGTT AAGTTCTTCC ATTATTGAAG ATATTAATGG CGTTGAGACG ATTAAAGCGC 138 0 

TGAATAGTGA AGAAACCGCC TATCATAAAA TTGATCATGA ATTTGTCACT TATTTAGAAA 144 0 

AATCATTTGT TTACGCTAAA ACAGAAGCCA CTCAAAATGC GATTAAAAGC CTCTTACAGC 1500 

TCTCTTTAAA TGTCGTGATC TTATGGGTTG GCGCACAACT GGTCATGACC AATAAAATTA 1560 

GTGTTGGTCA ACTGATCACT TACAATGCTT TATTAGGATT TTTTACAGAT CCCTTGCAAA 1620 

ATATTATTAA TTTACAAACT AAGCTCCAAC AGGCCTCAGT CGCTAATAAT CGTTTGAACG 1680 
AAGTTTATTT GGTTGATTCA GAATTTAAAG CTAGTCATCA AATGACAGAA AG CAT TAT GC 1740 
CCAATAGCTC AT TAG TAG CC GATCATATCA CCTATAAATA CGGTTTTGGT GCGCCAGCAA 1800 
TTGATGATGT TTCACTAACG ATTACAGCCG GTGAAAAAAT CGCTTTGGTT GGGATTAGTG 1860 
GATCAGGTAA ATCAACTTTA GTTAAATTGC TGGTTAATTT CTTTCAACCA GAGTCAGGGA 1920 
CAATTTCACT AGGACAAACA CCACTCGCCA ATCTTGATAA ACATGAGCTA AGAGCACACA 1980 
TTAATTATTT ACCACAAGAA CCCTTTATAT TTTCCGGTTC AATTATGGAC AACCTGTTAT 2040 
TGGGGGCTAA GCCAGGGACA ACCCAAGAAG ATATTATCAG GGCGGTAGAA ATTGCTGAAA 2100 
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2160 



l.U 

i 

I 

5 



TTAAAGATGA TATTGAAAAA ATGTCGCAAG GATTTGGCAC TGAACTCGCA GAAAGTGGCA 
ATATTTCGGG TGGTCAAAAA CAACGCATTG CTTTAGCTAG AGCCATTTTA GTCGATTCTC 
5 CGGTGCTGAT TTTAGATGAG TCAACCAGTA ATCTTGATGT TTTAACAGAA AAAAAGATTA 
TTGATAATCT CATGCAGTTA ACCGAAAAAA CCATTATCTT TGTAGCGCAC CGCTTAACCA 
TTTCACAGCG AGTAGATCGT ATTCTAACCA TGCAAAACGG CAAAATTATC GAAGATGGCA 
10 CGCATAATAC TCTGCTTAAT GCCGGTGGTT TCTACGCGTC ATTGTTTAAT CATTAAGGAG 
ACCTGATGTT TGATCCAAAA TACTTAGAAA GTGGCGAATT TTATCAACGI CGTTACCGCA 
15 ATTTTCCAAC TCTGATTATT GTGCCTATTT TTTTGTTAGT CGTGTTTATC ATTCTATTTA 
GCCTATTTGC TAAGCGTGAA ATTGTTGTCA AAGCAAGTGG CGAAATTATT CCAGCCAAAG 
« TGCTATCAGA TATCCAATCA ACCAGTAACA ATGCCATCGA TAGTAACCAA TTAACTGAAA 
I 0 ATAAAGTGGT TAAAAAAGGC GATACCTTAG TGACCTTTAC CAGTGGTAAT GAAAAAATAT 
» CGTCTCAATT ACTGACGCAA CAACTTAATA ATCTTAACGA CCGTCTAAAA AGTCTTGATA 
45 CCTATAAGCA GAGTATTGTT AACGGACGTA GCGAATTTGG TGGCACAGAT CAATTTGGTT 
S ATGATAGTCT ATTCAACGGC TATATGGCGC AAGTTGATAC GTTGACGAGT GAATTTAATC 
U AACAAAGTAG TGATAAACAA ACAGCTGATC AACAAGCTAA TCATCAAATT GACGTTTTAA 
i3 30 AACAAGGTCA ATCTAAAAAC AATCAAGAAT TAGCTAATTA TCAAGCTATT CTAACCAGTA 
5 TTAATAGCAA CACTAAACCG ACTAATAATC CCTATCAAGC CATTTATGAT AATTATTCAG 
f 3 3S CCCAGTTAAA ATCAGCACAA ACAACTGATG ATAAAGATCA AGTCAAGCAA ACTGCCTTAA 
GTAATGTACA ACAAGAAATT GATCAATTAC AAACAACGAG TAGTTCGTAT GATAGTCAAA 
TTGCTGGTAT TACAAAGAGT GGTCCTTTAT CTCAAAGCAG TACCTTAGAT AAAATCGCTG 
40 ACTTGAAGCA AGAACAACTA GCGAGTGCTC AAAAAGAAAT CAATGATCAG CAACAATCCT 
TAGATGAGTT AAAAGCCAAG CAATCGTGTG CTAATGAGGA TTATCAAGAT ACGGTTATTA 
45 AAGGACCAGA AGATGGCATT TTACATTTAG CCACTGACAA AACTAAAATC AAGTATTTCC 
CTAAAGGCAC AACCATTGCG CAAATTTATG CTAAACTGAC GCAAAAAACA GCTTTGAATG 
TTGAGTACTA TGTGCCTGCC AGTAATATTA TCGGCTTAAA GCAAAGACAA GCCATCCGTT 
50 TTGTAGCAAA TCAAAATGTC ACGAAACCGC TCACCTTAAA CGGAACAATC AAAAGCATTA 
GTTCTGCACC AATAGCCAGT AAAGAGGGAT CCTTTTATAA ATTAGTCGCG ACGATTCAGG 
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CTAGCAAAAT AGACCGTGAA CAGATTAAAT ATGGTCTTAA TGGTCGAATC ACAACCATAA 3780 

AAGGGACTAA AACATGGTTT AATTATTATA AAGACATTGT TTTAGGTGAG AATAATTAGC 3840 

/ 

5 ' TAGGAAGATA AACACAATTT TTAAACGTGT TTATCTTTTT TAGTCTCAAT GAAATTGTCG 3900 

CCGAAGGTTT TTCTAGCCAA GTGGCAGGAC ACAGAAAAAT GATAGTTGCT ACTGAAGGGA 3960 

AGTTCAACTG CCACCAAAAA TAGTAACCGC GCGACAGCCA ACCGCCACCA CAACAGTTAT 4020 

10 GCTCGCCCGT GGTTATTATT ATCATTAACA CTCTTACGTC TTTCTATGAT ACTTTTGAGC 4 080 

CACATTCTTA TAATGCTGCA ATCGACCTTT TAGAAAATTG ATCTCATCAG AAATTTCTTT 414 0 

15 TAAGTGGTTA TCATCAGCAT GTTTACTAGC AATATTTAAT TCTTTAATCC TACGTTTAAT 4 200 

CAACTTAGTA GTTTTAGTAT CTTTCATGTA TTGATTATCT CAAAAAAACA CCCAACAAGG 4 260 

P GCAATCAGTT TGATTTGAGC AGAGGAAGCC 4290 

□ 20 

j'Q (2) INFORMATION FOR SEQ ID NO: 4: 



I 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 amino acids 
y 25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 
f : d (D) TOPOLOGY: linear 

£5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

U (iv) ANTI-SENSE: NO 

O 

U 35 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: IcaC 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Phe Gin Lys Arg lie Asn Tyr He Ala Gin Vai Asp Glu Arg Asp 
15 10 15 

45 Cys Gly Val Ala Ala Leu Ala Met Val Leu Thr His Tyr Lys Thr Arg 

20 25 30 



50 



Leu Ser Leu Ala Lys Leu Arg Asp Leu Ala Lys Thr Asp Met Glu Gly 
35 40 45 

Thr Thr Ala Leu Gly He Val Lys Ala Ala Asn Ala Leu Asp Phe Glu 

50 55 60 

Thr Met Pro He Gin Ala Asp Leu Ser Leu Phe Asp Lys Lys Asp Leu 
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65 



Pro Tyr Pro Phe lie Ala His val He Lys Asp Gly Lys Tyr Pro His 

85 yu / 

Tyr Tyr Val Val Tyr Gly He Lys Gly Asp Gin Leu Leu lie Ala Asp 

100 105 
Pre Asp Asn Thr Val Gly Lys Asn Lys Met Thr Lys Ala His Phe Asn 

115 120 
Glu Glu Trp Thr Gly Val Ser lie Phe lie Ala Pro Asn Pro Thr Tyr 

130 135 
Lys Pro Thr Lys Asp Lys Lys Arg Ser Leu Thr Ser Phe lie Pro Val 
145 150 155 

lie Thr Arg Gin Lys Leu Leu Val He Asn lie Val He Ala Ala Leu 



165 



Leu Val Thr Leu Val Ser lie Leu Gly Ser Tyr Tyr Leu Gin Gly He 

180 lBb 
lie Asp Thr Tyr He Pro Asp Asn Met Lys Asn Thr Leu Gly He Val 

195 200 2 

Ser Leu Gly Leu He Phe Ala Tyr Val He Gin Gin Leu Leu Ser Tyr 

210 215 
Ala Arg Asp Tyr Leu Leu He Val Met Gly Gin Arg Leu Ser He Asp 
225 230 235 

He lie Leu Ser Tyr He Lys His He Phe Glu Leu Pro Met Ser Phe 



245 



250 



Phe Ala Thr Arg Arg Thr Gly Glu lie Val Ser Arg Phe Thr Asp Ala 
260 265 



Asn Ala He He Glu Ala Leu Ala Ser Thr Met Leu Ser Val Phe Leu 

275 280 

Asp Leu Gly He Leu Val He Val Gly Thr Val Leu Val Val Gin Asn 
' 290 295 300 

Ser Thr Leu Phe Leu He Ser Leu He Ala lie Pro Ala Tyr Ala Leu 
305 310 315 

Val Val Trp Leu Phe Met Arg Pro Phe Ser Lys Met Asn Asn Asp Gin 

325 330 

Met Gin Ala Gly Ser Met Leu Ser Ser Ser He He Glu Asp He' Asn 
340 345 " u 

Gly Val Glu Thr He Lys Ala Leu Asn Ser Glu Glu Thr Ala Tyr His 
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355 360 " 5 



Lys lAe Asp His Glu Phe Val Thr Tyr Leu Glu Lys S.r Ph. VI Tyr 
Ala Lys Thr Glu Ala Thr Gin Asn Ala II. Lys Ser Leu L.u Gin Leo 

sir Leu Asn v.l V.1 U. Leu Trp v.! sly Ala Gin Leu Val Met Thr 

405 41U 
Asn Lys He ser Val Gly Gin Leu II. Thr Tyr Asn Ala Leu Leu Gly 

420 

Phe Phe Thr Asp Pro Leu Gin Asn He He Asn Leu Gin Thr Lys Leu 



435 440 



Gin Gin Ala S.r Val Ala Asn Asn Ar g Leu Asn Glu Val Tyr Leu V.l 
T ? Z Glu Phe Lys Ala Ser His Gin Met Thr Glu Ser He Met Pro 



Asn Ser Ser Leu Val Ala Asp His He Thr Tyr Lys Tyr Gly Phe Gly 

R la Pro Ala lie Z Asp Val Ser Leu Thr He Thr Ala Gly Glu Lys 

500 5Ui3 
He Ala Leu Val Gly Ue Ser Gly Ser Gly Lys S.r Thr Leu Val Lys 

515 520 
Leu Leu Val Asn Phe Phe Gin Pro Glu S.r Gly Thr II. S.r L.u Gly 



530 

Leu Ala Asn Leu Asp Lys His Glu Leu Arg Ala His lie 



Gin Thr Pro Leu Ala asn u*u. ^ — - ^ 



Leu Pro Gin Glu Pro Phe He Phe Ser Gly S.r He Met Asp 



Asn Tyr 

565 



*sn Leu Leu L.u Gly Ala Lys Pro Gly Thr Thr Gin Glu Asp He lie 

580 585 
Ar, Ala Val Glu He Ala Glu lie Lys Asp Asp lie Glu Lys Met S.r 

595 600 



Gin Gly Phe Gly Thr Glu L.u Ala Glu Ser Gly Asn He Ser Gly Gly 

610 615 

Gin Lys Gin Ar g He Ala Leu Ala Arg Ala lie Leu Val Asp Ser Pro 
625 630 
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645 



» Toll Met Gin Leu Thr Glu Lys Thr He He 
Lys Lys He He Asp Asn Leu Met Gin 67Q 

660 

5 , lnleuThr lie Ser Gin Arg Val Asp Arg He Leu 

Phe Val Ala His Arg Leu Tnr 635 

rnv Lvs He He Glu Asp Gly Thr His Asn Thr Leu 
Thr Met Gin Asn Gly Lys lie n 7Q0 
10 690 695 

, n^ , r^^, Phe Tyr Ala Ser Leu Phe Asn His 
Leu Asn Ala Gly Gly Phe iyr 715 

705 

15 (2) INFORMATION FOR SEQ ID NO: 5: 

(i ) SEQUENCE CHARACTERISTICS: 

{ (A) LENGTH: 4 57 amino acids 
O (B) TYPE: amino acid 

m2 (C) STRANDEDNESS : single 

fE (D) TOPOLOGY: linear 

y (ii ) MOLECULE TYPE: protein 

% (iii) HYPOTHETICAL: NO 

W (iv) ANT I -SENSE: NO 

mO (vii) IMMEDIATE SOURCE: 

Jf" (B) CLONE: IcaD 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

35 



Ph e *s P Pro L,= Tyr ^ GX» Ser 01, SI. «- Tyr Oln ^ « g 

5 

^ Ton Tie He Val Pro He Phe Leu Leu Val 
n Phe Pro Thr Leu He lie vai 3Q 



40 



Tyr Arg Asn Phe fro ui — - 
3 20 " 



V,X P,« XI. .U - «- Ser ,e, «- «. Ly. « ttu U. ™ V.1 

45 Lys Ma 'sir sly «iu xi. xi- «■> «■ ^ ™ SS" S " " P " e 

50 D 

rin leu Thr Glu Asn Lys 
Ser Thr Ser Asn Asn Ala He Asp Ser Asn Gin 

65 

50 „ a X Val L,. f/ ,SP - Val TKr P„e Th r Ser Cly ^ Gl» 

^ Ue ser Ser 1 U. ^ - «» «» - M " "* *" "* 
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100 

Mg M g. s,r Xe„ -P - Tg xys - - XX. vg Mn ox, - 

set Glu £ oly CXy T h r f P «m «» -y Ty. »JP - - «- - 

130 

C g Tyr Met Ala 6ta Val A SP - Leu Thr Ser Glu - - Gin Gin 

rln Thr Ala Asp Gin Gin Ala Asn His Gin lie Asp 
Ser Ser Asp Lys Gin Thr Aia p 175 
165 

VaX ,s CX„ Oly OXn Ser xya »« - OXn OXn X« - »» Tyr 

OXn R Xa XX. Z - - XXe J.* Ser - - - "» 

nor, Tvr Ser Ala Gin Leu Lys Ser Ala 
Pro Tyr Gin Ala He Tyr Asp Asn Tyr Ser Al ^ 

210 

G1 „ «p »p g- *■> L - s; Tht Rli Leu sel 5s 

" OX. OXn OXn XX. - - - - So ^ S " 

S.r OXn IX. AXa Txy XXe Tnr Xys Ser OXy Pro Xe» Ser OXn Ser Ser 

2 60 

Th r Xe U Asp Xys XXe Ala <UP - Xy S OXn OX, OXn Xeu Ala Ser Ala 

OXn Xy S oil XX. Aen »P g» OXn Oln Ser Xe» Asp OX» Xeu Xys AXa 

290 

' Lys OXn ser ser AXa Asn OXu Asp Xyr Oln Asp Thr val XXe Xys Ma 

305 

. t Ma Thr- asd Lys Thr Lys He Lys 
Pro Glu Asp Gly lie Leu His Leu Ala Thr Asp Ly ^ 

325 

_ rlv Thr Thr He Ala Gin He Tyr Pro Lys Leu Thr 
Tyr Phe Pro Lys Gly Thr inr x 350 
1 340 

* vaI Glu Tvr Tyr Val Pro Ala Ser Asn He 
Gin Lys Thr Ala Leu Asn Val Glu Tyr ly ^ 

355 JD 



, rm Ala lie Arg Phe Val Ala Asn Gin Asn 
He Gly Leu Lys Gin Arg Gin Ala He Arg ^ 

370 

r1u Thr ne Lys Ser He Ser Ser 
^ T o^o leu Thr Leu Asn Gly Thr lie 
Val Thr Lys Pro Leu 



-112- 



395 

390 



PATENT 
Atty Docket 07254/051001 

400 



10 



_ s Glu Gly Ser Phe Tyr Lys Leu Val Ala Thr 
Ala Pro lie Ala Ser Lys Glu Gly ^ 

^ rm lie Lys Tyr Gly Leu Asn 
c tvs He Asp Arg Glu Gin lie by 

He Gin Ala Ser Lys ixe v ^ 

420 , T hr Tro Phe Asn Tyr Tyr 

Gly « II. - * Ly. 01, «« W s Thr Trp a5 

ly . MP U. «1 - °* =U M " 

450 

15 (2) INFORMATION FOR SEQ ID NO: 6: 

W H TYPE: amino acid 

S STRANDEDNESS : single 
2| JdJ TOPOLOGY: linear 

IJ MOLECULE TYPE: peptide 

|| (iii) HYPOTHETICAL: NO 

a (iv) ANTI-SENSE: NO 

3 (V ) FRAGMENT TYPE: N-terrainal 

I ^ii) ^fS^SSrgicin N-terminal 

35 ^NAME/KEY: Cleavage-site 

(B) LOCATION: 29.. 30 

" 40 <*> SEQUENCE »» » ""«; cys ser 

M « Ly. L,. 01" ». ^ " U 

h Phe Phe S« Thr Pro «!• G* -a Ser -a Ma ,1. Pr» 
45 Gly Ala Thr Phe Phe 5 „ 



Lys He Thr 
35 



50 (2 ) INFORMATION FOR SEQ ID NO: 7: 

r\ SEQUENCE CHARACTERISTICS: 
U) ^A) LENGTH : 29 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : • peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 

(vii) IMMEDIATE SOURCE: P = V ed 

(B) CLONE: ' Divergicin N-teriuinal, cleaved 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Lys Lys Gin He Leu Lys Gly Leu Val He Val Val Cys Leu Ser 

1 5 

Gly Ala Thr Phe Phe Ser Thr Pro Gin Gin Ma Ser Ala 
20 2b 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 



(vii) IMMEDIATE SOURCE: > 

(B) CLONE: leucocin A N-terminal 

(ix) FEATURE: 

(A) NAME /KEY : Cleavage-site 

(B) LOCATION: 24 . .25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 



Met Asn Met Lys Pro Thr Glu Ser Tyr Glu Gin Leu Asp Asn Ser 
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Ala Leu Glu Gin Val Val Gly Gly Lys Tyr Tyr Gly Asn Gly 
20 25 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: leucocin A, cleaved 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Met Asn Met Lys Pro Thr Glu Ser Tyr Glu Gin Leu Asp Asn Ser 
1 5 10 

Ala Leu Glu Gin Val Val Gly Gly 
20 

) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(vii) IMMEDIATE SOURCE: t 

(B) CLONE: lactococcin A N-terminal 



(ix) FEATURE: 

(A) NAME/KEY: Cleavage-site 
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(B) LOCATION : 21.. 22 

- — descri 7 ;;::::ir:: « «. «. ~ - 

Met Lys Asi> Gin Leu MJ Ph* " n 10 15 

L Ma Asn Gly CI, I*. - Tht g* "* G1 " 

20 

, INFORMATION FOR SEQ ID NO: 11: 
m SEQUENCE CHARACTERISTICS: 
U) ^ A) LENGTH : 21 amino acids 
(B) TYPE: amino acid 
C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
( v) FRAGMENT TYPE: N-terminal 

< VU) ^fcSr "ifctococcin A, cleaved 

(xi) SEQUENCE DESCRIPTION : SEQ 10 NO,l: ^ ^ ^ 

Met Lys Asn Gin Leu Asn Phe Asn He V.1 
1 5 

■ Glu Ala Asn Gly Gly 
20 

(2 ) INFORMATION FOR SEQ ID NO: 12: 

{i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 ammo acids 

(B) TYPE: amino acid 

C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(v) FRAGMENT TYPE: N-terminal 



{vii) IMMEDIATE SOURCE: / 

(B) CLONE: colicin V,- N-terminal 

(ix) FEATURE: 

(A) NAME /KEY : Cleavage-s ite 

(B) LOCATION: 15.. 16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Arg Thr Leu Thr Leu Asn Glu Leu Asp Ser Val Ser Gly Gly Ala 
15 10 15 

Ser Gly Arg Asp lie 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: colicin V, cleaved 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Arg Thr Leu Thr Leu Asn Glu Leu Asp Ser Val Ser Gly Gly 
15 10 15 

INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



20 



-117- 



PATENT 
Atty Docket 07254/051001 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: pCD3 . 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GATATCTTGG TAT T AC AAAC TAATTGGAGG TTGGTATATA TGAAAAAACA AATTTTAAAA 
GGGTTGGTTA TAGTTGTTTG TTTATCTGGG GCAACATTTT TCTCAACACC ACAACAAGCT 
5 TCTGCTGCTG CACCGAAAAT TACTCAAAAA CAAAAAAATT GTGTTAATGG ACAATTAGGT 
GGAATGCTTG CTGGAGCTTT GGGTGGACCT GGCGGAGTTG TGTTAGGTGG TATAGGTGGT 
0 GCAATAGCAG GAGGTTGTTT TAATTAAATG AAAATAAAAT GGTACTGGGA ATCTCTGATT 
GAAACCTTAA TATTTATAAT TGTTCTTCTT GTATTTTTTT ATAGAAGTTC TGGTTTTTCT 
TTAAAAAATT TAGTTTTAGG AAGTTTATTT TATTTGATAG CAATTGGTCT TTTTAATTAT 
15 AAAAAGATAA ACAAATAGGC ACTATTTTTA AATTTACAAC TTTTGCATTT TAAGTATATT 
GTTGTTATTA TTAAGGTGCG AGATGAGATA AGGTCTACAT GGACAGCACA AAACCCACCC 
50 CTAATGCGAA TAGGGGTGGG TTTTTTTCGT TCGTTGCGAA TACGAACGTG TGGGTTAGAG 
ACAACTTGCG AGATTATCGT CTAATCATCT AACCAATGAT CCACTAGTAT TAATACTAGT 
CCCACAAAAA GTGGAGCAAT AACCAATGAG ATAAGGTTTT CCATAAACAG CACCCCCTTT 
35 CAGGGGCAAG TTGCCACTTA CTAATATAGC ACAGCTCCTT TATTGTTCTT AGTCTAAATC 
TGATAAATCT TTTCTTGTTC AAAAATATAG ACCACTTAAA AGCTTATAAC GGTACTAGAT 
40 TTTTCAGATA CCCCAATTAC CTACTTAAAA CGTCTCTCTT TTTCGTTTTA AGATGTTTAA 
AATTATTTTC TATGAATTAT ACACAAATGT GCTTAAATCG TCTTAAATCG TCTTAAAATG 
TGGTCTGTGT TGAGAATACA ACGACTTTGT TTGGTCGTAC CTCTAAATCT GTTTGCTGTG 
45 AACGAGGGTA GCGAAGTGAA CTTTTTGTTG CTAACGCTCT TGGTTTTGTC TTTTGATTTT 
ATAAAATGTG GATGTAATCC ACTCCTTACT AGGGGTTTAA TCTTTATAAA ATAAAGGAGC 
50 TTGCGAATGC AAGGTGCCCT TTTTTCTTTG TCTGACTACT AGGGACAAAT TATCTGAGTA 
TGAACAAGAT TTTGTCTGTT CTTGCGCGTA TTTATTAATA TATATTTTAA GAGATATTTT 
AAGAGATATT TTAAAACCTT TTTAGGGGTG AGCTCAGCCT TAGAGAGAGT AAGCATTGAA 



-118- 



PATENT 

Atty Docket 07254/05 1001 

GC AT AG TACT AGGGACAAAT TATCTGACTA CTAGGGATAA ATTATCTGAC TACTAGGGAC 1380 

AAATTATCTG ACTACTAGGG ATAAATTATC TGACTACTAG GGACGCACTT TACTTTGTGT 1440 

ATCGTATCGT TTATAATCTT TATATGTGAG GGGAGGTCGA AAGGATTGGA AAAGAAAACG 1500 

AATTTAAAAA TTGCATATCA AAATGAATTG AATCTGGTTC CACTTAAAAA TTTCAATGCT 1560 

AAACAAATGG ATTTATTCTT TGCTTTGTGT GCCCGAATGA AAAATAAAGG GCTTAGAAAG 1620 

GTATCTTTTA CGTTTGAAGA ACTAAGAGAA CTAAGTGATT ACAAGATAAC TGCTATCGAA 1680 

CCGTTCACGA ATGATTTAGA ACAATTATAC AAAAAAATGT TAAACCTAAC ATACAGAACG 1740 

15 GAGACAGAAA CAAAAATCAG TTATTTCGTT TTATTTACTG GGTTTGTGAT TGATAAATCA 1800 

GAGCAAATTG TTGAAGTTAG TGTAAACCCA GACTTAGAAC ATATCATTAA CGGTATCTCT 1860 

AGTGAGTTCA ATAAATTTGA GTTACTAGCA TTCACAAGTA TCCAGTCGAA ATATACGAAA 1920 

11320 

O ACACTCTTTA GATTGCTTAT GCAGTTTCAA TCAACTGGGT TTTATGTGGT TAAAATTGAA 1980 

GATTTCAGAG CACTTTTAGA CATTCCAAAA TCTTATCAAA TGACTGACAT AACCCAACGG 2040 



10 



^5 ATATTGAAAC CTAGTTTAAT TGAGTTAAGT CAGTACTTTA AT GAT TT AAA AGTTAATAAA 2100 

ATTAAAGCTC GAAAGGGTAA TAAAATAGAC CGTTTAGAAT TCACTTTCTC CGGTCTAAAG 2160 

ACTGATTTAC CTAAAGTTCC ATTGCACGAC TGGACGAAAT AAAAAAAGGA CCTCCCCCTC 2220 

i'<30 

jiS ACATTTAAGC AAGTAGGAAC GTCCCTCGCA ATCCACGAAG ACTGCTGATT CATTTTAGCA 2280 

TATATTGTGC GGGACTTCTA AATAAATTAT ATTTGGAGGT CATTTTTATG TCGAATAAGT 2340 

i;335 ACTTGAAAAA AAGAAAGCGT CAAGCTAAGC AGGTAGCTGA TTTGTACGAT TTAATTATTG 2400 

! '* GGGTTGAACA TGCTGGCAGC TCGTTAATTG CGTTGTATGA GGGAATTAAA CCCTCTCAAT 24 60 

ATCGAATTTT TATTCTTTTG TCTTATTCTA GTTTTGAAAA TAAATTAAAC TTATACAACA 2520 

40 

AAGCGATTTT AAGAACTGAA GTTTATTCTT TAGAAAAAAA ATTAAACGAA AAAATAAATG 2580 

CTCAAATCAG AATTGCGCAA AAAAATAAAA AAGAAATTGC GGTAATTGAT TTCACAAAAC 2640 

45 AAAAAGAAAA ACTCAAAAGA GAATTACTTA GTTTTGAAAA TGATAAAGAA ATGAAACTTA 2700 

TGGATTCGCA ATTAAAACAA TTTCATGAAA ATAAAACGTT AGCTGATATT AATGATCAGT 2760 

TTTTTATGAC GGTACAAAAT AGTTTAATTT TGCTGCATAA AAAAGCACCT TTAACATTAA 2820 

50 

AATTAATTTG TTTGAAAAAT TATATTCGCC TTTGCAAAAA TTATTTTCTA AAGAATATAT 2880 

TTTAATGTTT TTTGAAAAAA ATAGTAACAT GGGAACATGT TGCTCTGCTC GCAAAAGGAA 2940 
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1 ' (vii) IMMEDIATE SOURCE: 

135 (g) CLONE: Divergicin structural gene 



40 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..228 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



ATG AAA AAA CAA ATT TTA AAA GGG TTG GTT ATA GTT GTT TGT TTA TCT 
45 Met Lys Lys Gin III Leu Lys Gly Leu Val lie V.l Val Cys Leu Ser 
1 5 10 10 

GGG GCA ACA TTT TTC TCA ACA CCA CAA CAA GCT TCT GCT GCT GCA CCG 
Gly Ala Thr Phe Phe Ser Thr Pro Gin Gin Ala Ser Ala Ala Ala Pro 
50 20 25 30 

AAA ATT ACT CAA AAA CAA AAA AAT TGT GTT AAT GGA CAA TTA GGT GGA 
Lys lie Thr Gin Lys Gin Lys Asn Cys Val Asn Gly Gin Leu Gly Gly 
35 40 4:3 



3000 
3060 
3120 
3180 



AAATATTTAA ACTAATAAAA AACCGTCGGA GACCAGCCAA CCAATAGGTT GGCTTTAAGT 
TTAAGCCTAC GTTGACAACT GTCAATGTAT AAGTGCGCCC TTTGGGTGTT TTATTTTTTG 
5 TTTAACTATT ATTTTCTGCA TAGGTTTTTT ATTTTTATTA ATTTGATTTT CAAGAAAGGG 
ATGAACCTAA AATGATTTAT AAACAAAAAA AGAAAGAAGA TGTTTTTGGA TTTCCTAAAG 
TTTTAACAAT TGCTGATTTG AG T AC GAG AT GGAAAATGTC ACGTCAAGCT ATCCATAAAA 3240 
10 AAATTCAAGA AGATTTATTA TTTCCTATGC CTGTTCAAAT TGTCTCAAAT GGAAAAATTA 3300 
AATTGTTTTT ATTTGTTGAT ATTGAAAAAT ACGAAAAAAA TCGTCCGTGG TTATTAGACA 3360 
15 TTAATTATCG AAATGAACGA CAACTTTGGA TTTACAAAAA TGGTTTTTTT AAATAGCAAG 
TTAGTCAATT ACCTTATACC TTGTTGGATA TCTTTGGATA AAAAAATAGT TGT AT 
r =r (2) INFORMATION FOR SEQ ID N0:15: 

,! io 

*-J (i) SEQUENCE CHARACTERISTICS: 
fly (A) LENGTH: 228 base pairs 

l;0 (B) TYPE: nucleic acid 

l.J (C) STRANDEDNESS : single 

jjj25 (D) TOPOLOGY: linear 

j'" (ii) MOLECULE TYPE: DNA (genomic) 

s, _ (iii) HYPOTHETICAL: NO 

£330 

S:H (iv) ANT I -SENSE: NO 



3420 
3475 



48 



96 



144 
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ATG CTT GCT GGA GOT TTG GGT GGA CCT GGC GGA GTT GTG TTA GGT GGT 192 
Met Leu Ala Gly Ala Leu Gly Gly Pro Gly Gly Va.l Val Leu Gly Gly 
50 55 60. 

/ 

5 ATA GGT GGT GCA ATA GCA GGA GGT TGT TTT AAT TA- • 228 

He Gly Gly Ala He Ala Gly Gly Cys Phe Asn 
65 70 75 

10 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



f: 



; io 



035 

E = 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Lys Lys Gin lie Leu Lys Gly Leu Val lie Val Val Cys Leu Ser 
* - 10 15 



m i 5 



hi Glv Ala Thr Phe Phe Ser Thr Pro Gin Gin Ala Ser Ala Ala Ala Pro 
h25 20 25 30 



Lys He Thr Gin Lys Gin Lys Asn Cys Val Asn Gly Gin Leu Gly Gly 

U 35 40 45 

ll30 Met Leu Ala Gly Ala Leu Gly Gly Pro Gly Gly Val Val Leu Gly Gly 

50 55 60 



He Gly Gly Ala He Ala Gly Gly Cys Phe Asn 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



50 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Divergicin immunity genes 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..171 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG AAA ATA AAA TGG TAC TGG GAA TCT CTG ATT GAA ACC TTA ATA TTT 
Met Ly^s fie Lys Trp Tyr Trp Glu Ser Leu lie Glu Thr Leu lie Phe 
10 1 5 10 

ATA ATT GTT CTT CTT GTA TTT TTT TAT AGA AGT TCT GGT TTT TCT TTA 
lie 111 Val Leu Leu Val Phe Phe Tyr Arg Ser Ser Gly Phe Ser Leu 
20 25 30 

15 AAA AAT TTA GTT TTA GGA AGT TTA TTT TAT TTG ATA GCA ATT GGT CTT 144 
Lys Asn Leu Val Leu Gly Ser Leu Phe Tyr Leu He Ala He Gly Leu 
35 40 45 



TTT AAT TAT AAA AAG ATA AAC AAA TA 
Phe Asn Tyr Lys Lys He Asn Lys 
50 55 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
y (A) LENGTH: 56 amino acids 

' (B) TYPE : amino acid 

(D) TOPOLOGY: linear 

jP (ii) MOLECULE TYPE: protein 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

I 135 Met Lys lie Lys Trp Tyr Trp Glu Ser Leu He Glu Thr Leu He Phe 
I'- 1 5 10 15 

lie lie Val Leu Leu Val Phe Phe Tyr Arg Ser Ser Gly Phe Ser Leu 
40 20 25 30 

Lys Asn Leu Val Leu Gly Ser Leu Phe Tyr Leu lie Ala He Gly Leu 
35 40 45 

45 Phe Asn Tyr Lys Lys He Asn Lys 
50 55 



50 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



48 



96 



171 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Divergicin signal sequence 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 1..123 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..123 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ATC TTG GTA TCA CAA ACT AAT TTG GAG GTT GGT ATA TAT GAA AAA ACA 
He Leu Val Ser Gin Thr Asn Leu Glu Val Gly He Tyr Glu Lys Thr 
15 10 15 

AAT TTT AAA AGG GTT GGT TAT AGT TGT TTG TTT ATC TGG GGC AAC ATT 
Asn Phe Lys Arg Val Gly Tyr Ser Cys Leu Phe He Trp Gly Asn He 
20 25 30 

TTT CTC AAC ACC ACA ACA AGC TTC TGC T 
Phe Leu Asn Thr Thr Thr Ser Phe Cys 
35 40 



48 



96 



124 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

He Leu Val Ser Gin Thr Asn Leu Glu Val Gly He Tyr Glu Lys Thr 
15 10 15 

Asn Phe Lys Arg Val Gly Tyr Ser Cys Leu Phe He Trp Gly Asn He 
20 25 30 

Phe Leu Asn Thr Thr Thr Ser Phe Cys 
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10 



15 



Ho 



IJ5 



mo 



<335 



40 



45 



50 



35 40 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(i-ii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Brochocin-C 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 
GCTATTTTGA GAAATATTAA CCAATAGTAA AAATTATCAT 
AAAATTATTT AAAGGAGGGT GTTTCATCAT GCACAAGGTA 
GTTACAACAG ATCGTGGGAG GTTACAGTTC AAAAGATTGT 
AATTGGTGCT GGTACAGTAG CTGGGGCAGC CGGCGGTGGC 
TATCCCAGGA GCATTCGTTG GAGCACATTT TGGAGTAATC 
TGGTGGATTA TTAGGTAACT AGGAGGTTAT AT T TAT G AAA 
TGAAATGAGT AGAATTATCG GCGGCAAAAT AAATTGGGGA 
TGGAGGTGCA GTAATTGGAG GCGCCCTCGG TGGACTAGGT 
TACAGGAGCT ATCGGAAGTA TTTGGGATCA ATGGTAAAAA 
ATTTCATTCG TTGCATTATG TAACTTTTTA ATAAAAAAAG 
TTATTTTTAA CAGGTTCTAT TGCTGTCTTT CTAATTATCT 
TTCTCTAACT AGTAC 

(2) INFORMATION FOR SEQ ID NO: 22: 



GCTATCTTTT 
AAAAAATTAA 
CTAAAAGATA 
CTAGCTGCAG 
GGCGGATCTG 
AAAGAACTAT 
AATGTTGGCG 
GGAGCTGGCG 
CT AT ACT ATT 
ATGTGTCTTC 
ATGATTTTCT 



GTATGTAATA 60 

ACAATCAAGA 120 

TTGGTAAAGG 180 

GATTAGGTGC 240 

CCGCATGCAT 300 

TGAATAAAAA 360 

GTTCTTGTGT 4 20 

GAGGTTGCAT 480 

TTCGGTTGTA 540 

AAAAAAAAAA 600 

AT GG AT TATA 660 
675 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



10 (vii) IMMEDIATE SOURCE: 

(B) CLONE: Brochocin-C peptide A 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
15 (B) LOCATION: 1..234 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 1. .54 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATG CAC AAG GTA AAA AAA TTA AAC AAT CAA GAG TTA CAA CAG ATC GTG 
Met His Lys Val Lys Lys Leu Asn Asn Gin Glu Leu Gin Gin lie Val 
1 5 10 10 

GGA GGT TAC ACT TCA AAA GAT TGT CTA AAA GAT ATT GGT AAA GGA ATT 
Gly Gly Tyr Ser Ser Lys Asp Cys Leu Lys Asp lie Gly Lys Gly He 
20 25 

GGT GCT GGT ACA GTA GCT GGG GCA GCC GGC GGT GGC CTA GCT GCA GGA 
Gly Ala Gly Thr Val Ala Gly Ala Ala Gly Gly Gly Leu Ala Ala Gly 
35 40 

TTA GGT GCT ATC CCA GGA GCA TTC GTT GGA GCA CAT TTT GGA GTA ATC 
Leu Gly Ala lie Pro Gly Ala Phe Val Gly Ala His Phe Gly Val lie 
50 55 60 

40 GGC GGA TCT GCC GCA TGC ATT GGT GGA TTA TTA GGT AAC TA 
Gly Gly Ser Ala Ala Cys He Gly Gly Leu Leu Gly Asn 
65 70 75 



W25 
□30 

m 

I J 35 

jj.JL 



48 



96 



144 



192 



234 



45 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
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Met His Lys Val Lys Lys Leu Asn Asn Gin Glu Leu Gin Gin lie Val 
1.5 10 15 

Gly Gly Tyr Ser Ser Lys Asp Cys Leu Lys Asp lie Gly Lys Gly lie 
5 ' ■ 20 25 30 

Gly Ala Gly Thr Val Ala Gly Ala Ala Gly Gly Gly Leu Ala Ala Gly 
35 40 45 

10 Leu Gly Ala He Pro Gly Ala Phe Val Gly Ala His Phe Gly Val He 
50 55 60 



15 



S . 2 



Ii5 



45 



Gly Gly Ser Ala Ala Cys He Gly Gly Leu Leu Gly Asn 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 24 



j!S% (i) SEQUENCE CHARACTERISTICS: 
if f 20 (A) LENGTH: 183 base pairs 

P (B) TYPE: nucleic acid 

l;3 (C) STRANDEDNESS: single 

j!5 (D) TOPOLOGY: linear 



j'125 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



O 30 

i;n 

(vii) IMMEDIATE SOURCE: 
|! „ (B) CLONE: Brochocin-C peptide B 

y 

□35 (ix) FEATURE: 

!<£ (A) NAME /KEY : CDS 

(B) LOCATION: 1..183 

(ix) FEATURE: 
40 (A) NAME /KEY : sig_peptide 

(B) LOCATION: 1..51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

ATG AAA AAA GAA CTA TTG AAT AAA AAT GAA ATG AGT AGA ATT ATC GGC 4 8 

Met Lys Lys Glu Leu Leu Asn Lys Asn Glu Met Ser Arg lie lie Gly 
15 10 15 



50 GGC AAA ATA AAT TGG GGA AAT GTT GGC GGT TCT TGT GTT GGA GGT GCA . 96 
Gly Lys lie Asn Trp Gly Asn Val Gly Gly Ser Cys Val Gly Gly Ala 
20 25 30 

GTA ATT GGA GGC GCC CTC GGT GGA CTA GGT GGA GCT GGC GGA GGT TGC 14 4 
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Val He Gly Gly Ala Leu Gly Gly Leu Gly Gly Ala Gly Gly Gly Cys 
35 40 45 

ATT ACA GGA GCT ATC GGA AGT ATT TGG GAT CAA TGG TA 183 / 

He Thr Gly Ala He Gly Ser He Trp Asp Gin Trp ' - 

50 55 60 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 
(R) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

^20 Met Lys Lys Glu Leu Leu Asn Lys Asn Glu Met Ser Arg He He Gly 
15 10 15 

i;o 
m 



10 



15 



LJ25 



50 



Gly Lys He Asn Trp Gly Asn Val Gly Gly Ser Cys Val Gly Gly Ala 
20 25 30 

Val He Gly Gly Ala Leu Gly Gly Leu Gly Gly Ala Gly Gly Gly Cys 
35 40 45 



Si „ lie Thr Gly Ala He Gly Ser He Trp Asp Gin Trp 
U30 50 55 60 



(2) INFORMATION FOR SEQ ID NO: 26: 



p35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

40 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
45 (iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Brochocin-C immunity peptide 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..162 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



ATG GTA AAA ACT ATA CTA TTT TCG GTT GTA ATT TCA TTC GTT GCA TTA 
Met Val Lys Thr He Leu Phe Ser Val Val He Ser Phe Val Ala Leu 
1 ' 5 10 15 



l! 320 (2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
Ji9 (A) LENGTH: 53 amino acids 

hi (B) TYPE: amino acid 

1,025 { D) TOPOLOGY: linear 

jj (ii) MOLECULE TYPE: protein 

u (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

K330 

P Met Val Lys Thr He Leu Phe Ser Val Val He Ser Phe Val Ala Leu 
j.& 15 10 15 

1 1 

Cys Asn Phe Leu He Lys Lys Asp Val Ser Ser Lys Lys Lys Leu Phe 
35 20 25 30 

Leu Thr Gly Ser He Ala Val Phe Leu He He Tyr Asp Phe Leu Trp 
35 40 45 

40 He He Phe Ser Asn 
50 



(2) INFORMATION FOR SEQ ID NO: 28 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2226 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



48 



TGT AAC TTT TTA ATA AAA AAA GAT GTG TCT TCA AAA AAA AAA TTA TTT 96 
Cys Asn Phe Leu He Lys Lys Asp Val Ser Ser Lys Lys Lys Leu Phe 
20 25 30 

TTA ACA GGT TCT ATT GCT GTC TTT CTA ATT ATC TAT GAT TTT CTA TGG 14 4 

Leu Thr Gly Ser He Ala Val Phe Leu He He Tyr Asp Phe Leu Trp 
35 40 45 



15 ATT ATA TTC TCT AAC TA 162 
He He Phe Ser Asn 
50 
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(iv) ANTI-SENSE: NO 

(vii) IMMEDIATE SOURCE: 
5 (B) CLONE: Enterocin 900 operon 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

10 AAGCTTTACT TGATATTAGT TCTGAGTTCT GCCTGATTTA TCAGTAACAT AACTCTAGAG 60 

ATAACTGCGT CGCTATCTCA AGTTCTTTTT TCTTTTCTTA CAAAATAAAT ATACTTTATT 120 

TCATTTTATA AGTCAACGTT TTCATTGCTT ATAAATTAGT TTTTTTAATC ATCTCAGTAA 18 0 

15 

TAATTTGCTA TGTCAGTTCG ATCAATACCA TTTGCATGAA AGTACAGCTA TAAGCCAATC 24 0 

ACACCTAACC TACTCTTAAT CGTATAATGA TTCCAGTTAG CAAGGTCTTT AATACTCATG 300 

J'iO ATCCCCATCT GGTTGAGTTT TTTTCATTCG AAACCCTATG CCCCAAAAGA CCGTCATTTT 360 

^ AGGAATATTC CATACTTTCT CAGGAACATT TTGATACGTC CATTCAGCAA TAAACCCTTC 4 20 

W ATTGTGCTTC GCTTCATTGT CTAAGGCGAG TTTGGCCAAA GGGGATTATC TCCGCGACAC 4 80 

iy25 

3 CTACCGTAGC AATCAATCCT AATTCTTCTT TAATACGTTC CTTGGATCAT TTGAACGAAT 540 

<.,* TTTTTCCTTC TCTGAGTTCT TGTTCCTTCA GTCGTAAAAA TATTCAGTGA TCTGGTCACT 600 

□30 TTTAAAATGG GTTCATCGAT TGGATACATC AGTAGATCTT CGTCAGCCAC ATATCTTTTG 660 

it. : a 

U AAAATATTAT TTACCCGCAT ATTTCGCTTG ATATATAGGT TCATACGTGG TGGAACAACG 720 

;f TATGATGTTT TAGGAAATAG TTGTGATAAA TCACGTGGTC TACTCACATT TGTAATATCA 7 80 

lJ 35 

1 ! - TACCGCTTTT TTGCTTCAGG AGAAGAAGCT CTAATATCAA TCCTAAACCA GTATTGTCAG 8 40 

CGCGACTCAT AACAACAAGT TCTGTTGTTA ATGGATCAAA ATTTCTTTCT ATACACTCGA 900 

40 TACTCGCATA AAAAGGCTTC ATGTCGATTA GAAAATAATC ATTTACTGAT TCTTTCGAAT 960 

AATCCAGCAT GAATAACACC CATTCTTTTT CACATTACAC AAACGTAAGT TAGGAAATAT 1020 

AAAGAAGAAA ACTAAATAGC ACTAAACAAA CAAGACAACT CATGCTTATT CCGTATAAGA 1080 

45 

AACTACATAT TATGTTAACT AGTTATTAAA ATAACATATT TAATAAAATT AAATTGTGAT 1140 

TTTATAGGTT TCAGGAATGA AAAAGCCTTA TTTCAGGAAG TTTTTAACTG TTTGCTATAG 1200 

50 ATGTATGTCA T GAT AG CATC GTAATAAAAA TACTCTAAAA GGAGCGAGTT TAAATATGCA 1260 

AAATGTAAAA GAATTAAGTA CGAAAGAGAT GAAACAAATT ATCGGTGGAG AAAATGATCA 1320 

CAGAATGCCT AATGAGTTAA ATAGACCTAA CAACTTATCT AAAGGTGGAG CAAAATGTGG 1380 
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TGCTGCAATT GCTGGGGGAT TATTTGGAAT CCCAAAAGGA CCACTAGCAT GGGCTGCTGG 
GTTAGCAAAT GTATACTCTA AATGCAACTA AAAAAGAAGA GAAAAAACTC ATTACGAGTT 
5 TTTTCTCTTC TTTTTTTGCA TGAAATTAGG AATAACTAAT AAAACAATAG CAATCAATAG 
TAAAATCTTA CTTAATATAG TTTCGGAAAA AATAAATAAT CCTAAATTTA TAATTACTGC 
TAAAAAAATG CATAAATTAT ACTCTAAATT ATTTTTTTTT AAATTCATAA TATAAACATC 
10 CTCTCTTTAA TTAGTCTACC ATTCCGAAAT ATTTCATCCC CAGCTCTTTT TTTACTAATA 
TACCAACTAC ATTTAATAAC AAAATAACTA GTAAACTTAA TATTTTTAGT GGCATAGAAT 
15 ATT C AAAAAT AAATAAAGGC ACCATACATG TAGCTATCAA TATAAATACA GAACTTACGT 
ATTTTATTAT TTTACGGAAC ATTATAACCT ATTACAACTC CGCAAATAGC CATAGCCCAT 
□ ACCATAGATA AGATTTTTAC CAGCACCACC ACCACATGTT TGTTTTATCT CTTTCATACT 

j;J TAATTTTTTT ACATTTTGCA TGTCTCTACA TGCTCCTTTT AAAGTTTTTT TAGAACCTCA 

H CGACTATAAC ATGGATAATT TAATCGTGGT CAAAAACTTC CTGAAATAGG GTGTTTCATA 

i'-j 

^ 25 TCCTGAACAC GAATTTTTAG TCAATTTTCG AAAAAT G AAA CTTTAAAATT TCTTTGACCA 

i'j GAACTCTATT TATTCTTGTG TTGTTCCTTC GAATAGGTTC CCGTATATCT TTTTTATTTG 

j'5 AAGCTT 

30 

(2) INFORMATION FOR SEQ ID NO: 29: 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2226 




(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



MOLECULE TYPE: DNA (genomic) 



40 



(iii) 



HYPOTHETICAL: NO 



(iv) 



ANTI-SENSE: NO 



(vii) 



IMMEDIATE SOURCE: 
(B) CLONE: Enterocin 900 peptide 



50 



(ix) 



FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..216 



(ix) 



FEATURE: 
(A) NAME /KEY : sig_peptide 
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(B) LOCATION: 1..54 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ATG CAA AAT GTA AAA GAA TTA AGT ACG AAA GAG ATG AAA CAA ATT ATC 4 8 

Met Gin Asn Val Lys Glu Leu Ser Thr Lys Glu Met Lys Gin He He 
15 10 15 

GGT GGA GAA AAT GAT CAC AGA ATG CCT AAT GAG TTA AAT AGA CCT AAC 96 
Gly Gly Glu Asn Asp His Arg Met Pro Asn Glu Leu Asn Arg Pro Asn 
20 25 30 

AAC TTA TCT AAA GGT GGA GCA AAA TGT GGT GCT GCA ATT GCT GGG GGA 14 4 

Asn Leu Ser Lys Gly Gly Ala Lys Cys Gly Ala Ala He Ala Gly Gly 
35 40 45 

TTA TTT GGA ATC CCA AAA GGA CCA CTA GCA TGG GCT GCT GGG TTA GCA 192 
Leu Phe Gly He Pro Lys Gly Pro Leu Ala Trp Ala Ala Gly Leu Ala 
50 55 60 

AAT GTA TAC TCT AAA TGC AAC TA 2i6 
Asn Val Tyr Ser Lys Cys Asn 
65 70 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Gin Asn Val Lys Glu Leu Ser Thr Lys Glu Met Lys Gin He He 
15 10 15 

Gly Gly Glu Asn Asp His Arg Met Pro Asn Glu Leu Asn Arg Pro Asn 
20 25 30 

Asn Leu Ser Lys Gly Gly Ala Lys Cys Gly Ala Ala lie Ala Gly Gly 
35 40 45 

Leu Phe Gly He Pro Lys Gly Pro Leu Ala Trp Ala Ala Gly Leu Ala 
50 55 60 

Asn Val Tyr Ser Lys Cys Asn 
65 70 

(2) INFORMATION FOR SEQ ID NO: 31: 



-131- 



PATENT 
Atty Docket 07254/051001 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Colicin V prepeptide 

(ix) FEATURE: 

(A) NAME /KEY : Disulf ide-bond 

(B) LOCATION: 91.- 102 

(ix) FEATURE: 

(A) NAME /KEY : Cleavage-site 

(B) LOCATION: 15 . . 16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Arg Thr Leu Thr Leu Asn Glu Leu Asp Ser Val Ser Gly Gly Ala 
15 10 15 

Ser Gly Arg Asp He Ala Met Ala He Gly Thr Leu Ser Gly Gin Phe 
20 25 30 

Val Ala Gly Gly He Gly Ala Ala Ala Gly Gly Val Ala Gly Gly Ala 
35 40 45 

He Tyr Asp Tyr Ala Ser Thr His Lys Pro Asn Pro Ala Met Ser Pro 
50 55 60 

Ser Glv Leu Gly Gly Thr He Lys Gin Lys Pro Glu Gly He Pro Ser 
65 70 75 80 

Glu Ala Trp Asn Tyr Ala Ala Gly Arg Leu Cys Asn Trp Ser Pro Asn 
85 90 95 

Asn Leu Ser Asp Val Cys Leu 
100 

) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
<iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Colicin V 

<ix) FEATURE: 

(A) NAME /KEY : Disul fide-bond 

(B) LOCATION: 76.. 87 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Ser Glv Arq Asp He Ala Met Ala He Gly Thr Leu Ser Gly Gin 
15 10 15 

Phe Val Ala Gly Gly He Gly Ala Ala Ala Gly Gly Val Ala Gly Gly 
20 25 30 

Ala He Tyr Asp Tyr Ala Ser Thr His Lys Pro Asn Pro Ala Met Ser 
35 40 45 

Pro Ser Gly Leu Gly Gly Thr He Lys Gin Lys Pro Glu Gly He Pro 
50 55 60 

Ser Glu Ala Trp Asn Tyr Ala Ala Gly Arg Leu Cys Asn Trp Ser Pro 
65 70 75 80 

Asn Asn Leu Ser Asp Val Cys Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: carnobacteriocin BM1 
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(ix) FEATURE: 

(A) NAME /KEY : RBS 

(B) LOCATION: 89. .93 

5 (ix) FEATURE: 

(A) NAME /KEY : RBS 

(B) LOCATION: 290.. 293 

(ix) FEATURE: 
10 (A) NAME/ KEY : CDS 

(B) LOCATION: 103. .285 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 
15 (B) LOCATION: 103.. 150 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
H (B) LOCATION: 303.. 566 

m 

m 

W TCGAGATACG TTTATCCATG GTTCAGGATG ATTTTATCAA CGAGTTAATT ATTTATGCTA 60 

ld25 

:^ CAGTAAACTT GTTACTAAAT ACTTTATAAG GAGTGTATGT AC ATG AAA AGC GTT 114 
|,b Met Lys Ser Val 

7 i 

^?30 aaa gaa cta aat aaa aaa gaa atg caa caa att aat ggt gga gct atc 162 

'' n Lvs Glu Leu Asn Lys Lys Glu Met Gin Gin He Asn Gly Gly Ala He 
M * 5 10 15 20 

*3 

n TCT TAT GGC AAT GGT GTT TAT TGT AAC AAA GAG AAA TGT TGG GTA AAC 210 

: f 35 Ser Tyr Gly Asn Gly Val Tyr Cys Asn Lys Glu Lys Cys Trp Val Asn 

25 30 35 

AAG' GCA GAA AAC AAA CAA GCT ATT ACT GGA ATA GTT ATC GGT GGA TGG 258 
Lys Ala Glu Asn Lys Gin Ala He Thr Gly He Val He Gly Gly Trp 
40 40 45 50 

GCT TCT AGT TTA GCA GGA ATG GGA CAT TAAAGAGGTA TCTAGTT ATG ATA 308 
Ala Ser Ser Leu Ala Gly Met Gly His Met He 

Re; 60 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



55 60 

AAA GAT GAA AAA ATA AAT AAA ATC TAT GCT TTA GTT AAG AGC GCA CTT 356 

Lvs Asp Glu Lys He Asn Lys He Tyr Ala Leu Val Lys Ser Ala Leu 
5 10 15 

50 GAT AAT ACG GAT GTT AAG AAT GAT AAA AAA CTT TCT TTA CTT CTT ATG . 404 

Asp Asn Thr Asp Val Lys Asn Asp Lys Lys Leu Ser Leu Leu Leu Met 
20 25 30 

AGA ATA CAA GAA ACA TCA ATT AAT GGA GAA CTA TTT TAC GAT TAT AAA 4 52 
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Arg lie Gin Glu Thr Ser lie Asn Gly Glu Leu Phe Tyr Asp Tyr Lys 
35 40 45 50 

AAA GAA TTA CAG CCA GCT ATT AGT ATG TAC TCT ATT CAA CAT AAC TTT 500 
Lys Glu Leu Gin Pro Ala lie Ser Met Tyr Ser lie Gin His Asn Phe 
55 60 65 

CGG GTT CCT GAC GAT CTA GTA AAA CTG TTA GCA TTA GTT CAA ACA CCT 54 8 

Arg Val Pro Asp Asp Leu Val Lys Leu Leu Ala Leu Val Gin Thr Pro 
70 75 80 

AAA GCT TGG TCA GGG TTT TAACTTTAGT TCCAGATGAG TTAAAATCCT 596 
Lys Ala Trp Ser Gly Phe 
85 

TAAAAATAAG GAATAATGGT AAATCAGCAT TCCTTATTTT TATAGTCATC ACACTATAAC 65 6 

TTTACTTAAA GATGTTCGA 675 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Lys Ser Val Lys Glu Leu Asn Lys Lys Glu Met Gin Gin lie Asn 
15 10 15 

Gly Gly Ala lie Ser Tyr Gly Asn Gly Val Tyr Cys Asn Lys Glu Lys 
20 25 30 

Cys Trp Val Asn Lys Ala Glu Asn Lys Gin Ala lie Thr Gly lie Val 
35 40 45 

lie Gly Gly Trp Ala Ser Ser Leu Ala Gly Met Gly His 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
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Met He Lys Asp Glu Lys He Asn Lys He Tyr Ala Leu Val Lys Ser 
1-5 10 15 

Ala Leu Asp Asn Thr Asp Val Lys Asn Asp Lys Lys Leu Ser Leu Leu 
5 20 25 ' - 30 

Leu Met Arg He Gin Glu Thr Ser He Asn Gly Glu Leu Phe Tyr Asp 
35 40 45 

10 Tvr Lys Lys Glu Leu Gin Pro Ala He Ser Met Tyr Ser He Gin His 
50 55 60 

Asn Phe Arq Val Pro Asp Asp Leu Val Lys Leu Leu Ala Leu Val Gin 
65 70 75 80 

15 

Thr Pro Lys Ala Trp Ser Gly Phe 
85 

r <* (2) INFORMATION FOR SEQ ID NO: 36: 

"i20 

^ (i) SEQUENCE CHARACTERISTICS: 
I'M (A) LENGTH: 1907 base pairs 

CQ (B) TYPE: nucleic acid 

ill (C) STRANDEDNESS : double 

Q25 { D) TOPOLOGY: linear 

"1 (ii) MOLECULE TYPE: DNA (genomic) 

SI (iii) HYPOTHETICAL: NO 

□ 30 

|;f1 (iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
35 (B) CLONE: carnobact eriocin B2 operon 



40 



(ix) FEATURE: 

. (A) NAME /KEY : -35_signal 

(B) LOCATION: 165.. 170 

(ix) FEATURE: 

(A) NAME /KEY : -10_signal 

(B) LOCATION: 189.. 193 



45 (ix) FEATURE: 

(A) NAME /KEY : RBS 

(B) LOCATION: 221.. 225 

(ix) FEATURE: 
50 (A) NAME /KEY : CDS 

(B) LOCATION: 236. .433 

(ix) FEATURE: 

(A) NAME /KEY : RBS 
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10 



15 



>?0 
CO 

CO 

y 



30 



r35 



40 



45 



50 



(B) LOCATION: 460.. 465 

(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 473.. 805 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1025.. 1267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AAGCTTTTAT AGTACAATTA TTTATGCGTG CTATGCAATA GCTATTGTAT ATACTATTTT 

TAC TAT GAGA AAAGATTCTT ATGAAAATAA CAAAAATAAT CGTAAAAAAG TTATATAGCA 

TTTATTTCAT TTATGAATTC AAATACCCTG GTTCAAGATG TATTTTCCAA AAAAATGTTC 

AGATATGATA TAGTTTTTTT GAAATACAAA TATAAAATAA AGGAGTTTGA TTTAG ATG 

Met 
1 

AAT AGC GTA AAA GAA TTA AAC GTG AAA GAA ATG AAA CAA TTA CAC GGT 
' Asn Ser Val Lys Glu Leu Asn Val Lys Glu Met Lys Gin Leu His Gly 
5 10 15 

GGA GTA AAT TAT GGT AAT GGT GTT TCT TGC AGT AAA ACA AAA TGT TCA 
Gly Val Asn Tyr Gly Asn Gly Val Ser Cys Ser Lys Thr Lys Cys Ser 
20 25 30 

GTT AAC TGG GGA CAA GCC TTT CAA GAA AGA TAC ACA GCT GGA ATT AAC 
Val Asn Trio Gly Gin Ala Phe Gin Glu Arg Tyr Thr Ala Gly He Asn 
35 " 40 45 

TCA TTT GTA AGT GGA GTC GCT TCT GGG GCA GGA TCC ATT GGT AGG AGA 
Ser Phe Val Ser Gly Val Ala Ser Gly Ala Gly Ser He Gly Arg Arg 
50 55 60 65 

CCG TAAATATATA AATACAATAT AGAGCAAGGT GGT GAT AC A ATG GAT ATA AAG 
Pro Met Asp He Lys 

1 

TCT CAA ACA TTA TAT TTG AAT CTA AGC GAG GCA TAT AAA GAC CCT GAA 
Ser Gin Thr Leu Tyr Leu Asn Leu Ser Glu Ala Tyr Lys Asp Pro Glu 
5 10 15 20 

GTA AAA GCT AAT GAA TTC TTA TCA AAA TTA GTT GTA CAA TGT GCT GGG 
Val Lys Ala Asn Glu Phe Leu Ser Lys Leu Val Val Gin Cys Ala Gly 
25 30 35 

AAA TTA ACA GCT TCA AAC AGT GAG AAC AGT TAT ATT GAA GTA ATA TCA 
Lys Leu Thr Ala Ser Asn Ser Glu Asn Ser Tyr He Glu Val He Ser 
40 45 50 * 



60 
120 
180 
238 

286 

334 

382 

430 

484 

532 

580 

628 
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TTP CTA TCT AGG GGT ATT TCT AGT TAT TAT TTA TCC CAT AAA CGT ATA 
III Leu Ser Arg Gly He Ser Ser Tyr Tyr Leu Ser His Lys Arg He 



55 60 65 



/ 



10 



ATT CCT TCA AGT ATG TTA ACT ATA TAT ACT CAA ATA CAA AAG GAT ATA 
fie Pro Ser sfr Leu Thr He Tyr Thr Gin lie Gin Lys Asp He 

70 75 80 

AAA AAC GGG AAT ATT GAC ACC GAA AAA TTA AGG AAA TAT GAG ATA GCA 
Lys Asn Gly Asn He Asp Thr Glu Lys Leu Arg Lys Tyr Glu He Ala 
85 90 y 



CAAAATGTGA TGAAATTTGT CCCCAATTTT GGACCTTCAT GGTCCATTTT TTCGTTACAT 
CCATCGTCAC TAAACAAAGC ATTTTTAGTA AGGATTCATC AGATGGGAAT ACTACCTTAG 



676 



724 



772 



AAA GGA TTA ATG TCC GTT CCT TAT ATA TAT TTC TAATTTTTTC AATGATGTTA 
Lys Gly Leu Met Ser Val Pro Tyr He Tyr Phe 
15 105 110 

GTTGACTTCA AAAAGATGTG AAATCGATTA GCATTTTCAA AAT TAG ATT A AAAATACTAT 
□ CTATATAAAA TAGAACTACT GATTTAAAGT ATTTATAAGA ATATAAAGTA GCAAATAACA 
|:f 0 TGATAGACAC AATTAAGGAG CGACATTTTA TGGAAAATTT GAAATGGTAT TCGGGCGGGA 
[J ACGATAGAAA AAAAAAGCA ATG GCT ACT ATT ACT GAT TTG TTA AAC GAT TTA 

AAA ATA GAC TTA GGT AAC GAA TCT CTA CAA AAT GTC TTA GAA AAT TAT 
L^ lie Sp Leu* Gly fsn Glu Ser Leu Gin Asn Val Leu Glu Asn Tyr 

■ 15 20 

# 0 CTT GAA GAA TTG GAA CAA GCA AAT GCT GCT GTT CCA ATT ATA TTA GGC 1153 
I'* HI IZ gTu Hu Glu Gin Ala Asn Ala Ala Val Pro He He Leu Gly 
1.3 30 35 40 

, % s» us s ss s in s s s s; sg s ss ss s 1201 

45 50 55 



1057 



1105 



TTA' TCA GAA ATT CAG TCT AAA AAA TTA AAA GAG CTG ATT TCA ATA TCC 
40 Leu Ser Glu He Gin Ser Lys Lys Leu Lys Glu Leu He Ser He Ser 
60 65 70 

TAT ATT AAA TAT GGC TAT TAATTTAGTA TTAATAACAG TGTAGGATTG 
Tyr lie Lys Tyr Gly Tyr 
45 80 

ATTCAAATTA TTTGAATCAA AATTTATATA CAAATTTTAT TTATTTTGGG TCTTTAAATA 
ATTTTGTGTA AGTTCAAATT ATTTAAAGAT GAGTTAAAAC TCTATCTTCG AAAAACATCA 
50 mm ^. rrwraT nrvrrnATTTT TTCGTTACAT 1477 



1249 

1297 

1357 
1417 



1537 
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ATTTTGTTGG CTTTCACAGC TGACAATGGA GGCCTTCAAT CACATTGGCG GTATAAATAA 1597 

TCCGGCGCAA ATCTGCTGAA TACTTGAAAA ATGTCGCTAA TTCTGCCCAG TTATCTATCA 1657 

/ 

5 ATTCTTCAAT TAATTTGTCT GCGTTTATAT CCATTTTCAT TCCCCTTTTT TAATTTTTCA 1717 

TTTTTTAGTT ACTTTAAACG GTTTAAAGCC TTAAGCACTT AGGCTTTAAT CTTTTTTCAC 1777 

TTGATCTAAT TATTTGAACT TCAGCATTTA TCTTTTGATT TATTCTTTTA GGGAATTGAC 1837 

CGAATAGGGA GATTTCCTGT GAGTAGGCGC CAACGGTGGT GGCGGTCGGA GTCAGCCGAC 18 97 

TCACAAGCTT . 1907 



15 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
j«* s (A) LENGTH: 66 amino acids 

!'^0 (B) TYPE: amino acid 

lZ (D) TOPOLOGY: linear 

V3 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



i. s Met Asn Ser Val Lys Glu Leu Asn Val Lys Glu Met Lys Gin Leu His 
^ 1 5 10 15 



E330 Gly Gly Val Asn Tyr Gly Asn Gly Val Ser Cys Ser Lys Thr Lys Cys 
Cm 20 .25 30 



Ser Val Asn Trp Gly Gin Ala Phe Gin Glu Arg Tyr Thr Ala Gly He 

35 40 45 

Asn Ser Phe Val Ser Gly Val Ala Ser Gly Ala Gly Ser He Gly Arg 

50 55 60 



Arg Pro 
40 65 

(2) INFORMATION FOR SEQ ID NO: 38: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met Asp He Lys Ser Gin Thr Leu Tyr Leu Asn Leu Ser Glu Ala Tyr 
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1 



5 



10 



15 



Lys Asp Pro Glu Val Lys Ala Asn Glu Phe Leu Ser Lys Leu Val Val 
20 25 30 

Gin Cys Ala Gly Lys Leu Thr Ala Ser Asn Ser Glu Asn Ser Tyr He 
35 40 45 

Glu Val He Ser Leu Leu Ser Arg Gly He Ser Ser Tyr Tyr Leu Ser 
50 55 60 

His Lys Arg He He Pro Ser Ser Met Leu Thr He Tyr Thr Gin He 
65 70 75 80 

Gin Lvs Aso He Lys Asn Gly Asn He Asp Thr Glu Lys Leu Arg Lys 
85 90 95 

Tyr Glu He Ala Lys Gly Leu Met Ser Val Pro Tyr He Tyr Phe 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Ala Thr He Thr Asp Leu Leu Asn Asp Leu Lys He Asp Leu Gly 
15 10 15 

Asn Glu Ser Leu Gin Asn Val Leu Glu Asn Tyr Leu Glu Glu Leu Glu 
20 25 30 

Gin Ala Asn Ala Ala Val Pro He He Leu Gly Arg Met Asn He Asp 
35 40 45 

He Ser Thr Ala He Arg Lys Asp Gly Val Thr Leu Ser Glu He Gin 
50 55 60 

Ser Lys Lys Leu Lys Glu Leu He Ser He Ser Tyr He Lys Tyr Gly 



65 



70 



75 



80 



Tyr 



(2) INFORMATION FOR SEQ 



ID NO:40: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 base pairs 
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10 



15 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

/ 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE : JMc7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

35 

CCCAAGCTTC TGCTGTAAAT TATGGTAATG GTGTT 
(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 36 base pairs 
k,l20 (B) TYPE: nucleic acid 

|;g (O STRANDEDNESS: single 

?n (D) TOPOLOGY: linear 



4 25 



m 30 



(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: KLR17 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 



GCGCAAGCTT CTGCTCGGAC ACCAGAAATG CCTGTT 

Q 

\& 35 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



36 



45 



(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: KLR180 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

30 

GGCCAAGCTT GCCATTAAGT CTGGTTGCTA 
(2) INFORMATION FOR SEQ ID NO: 43: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid f 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MB32 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
AATTCGAGCT CGCCCAAATC 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MB37 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TGAGTAATTT TCGGTGCAGC ACCTCCTACG ACTTGTTCGA 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: RW58 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
TACGCGCAAG AACAGACAAA ATC 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MB38 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TGAGTAATTT TCGGTGCAGC TCCTCCGTTA GCTTCTGAAA 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MB39 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TACGAATTCG AGCTCGCCC 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vii). IMMEDIATE SOURCE: 
(B) CLONE: MB4 2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
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ATTTTCGGTG CAGCACCTCC AGAAACAGAA TCTAATTCAT TTAGAGTCAG AGTTCTCATA 60 
ATAACTTTCC TCTTTT 76 
5 (2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



15 



i 
I 



40 



50 



(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: MB41 



!'i0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

i l5J 5 

TGAGTAATTT TCGGTGCAGC CATAATAACT TTCCTCTTTT 4 0 

l;y 

I J (2) INFORMATION FOR SEQ ID NO: 50: 
1025 

B - (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

^ (C) STRANDEDNESS: single 

□30 (D) TOPOLOGY: linear 

| t ? s 

lA (ii) MOLECULE TYPE: cDNA 



35 (vii) IMMEDIATE SOURCE: 

(B) CLONE: MB4 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
ATATCACGCC CTGAAGCACC TCCTACGACT TGTTCGA 37 
(2) INFORMATION FOR SEQ ID NO: 51: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(vii) IMMEDIATE SOURCE: 
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(B) CLONE: MB4 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID/NO: 51: 
AATTAAGCTT GGATCCTTCT GTGTGGATTG TCCAAT 
(2) INFORMATION FOR SEQ ID NO: 52: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 

(ii) MOLECULE TYPE: cDNA 



n (vii) IMMEDIATE SOURCE: 

5! f 20 (B) CLONE: APO-1 

*'9 (ix) FEATURE: 

IQ (A) NAME /KEY : misc_dif f erence 

I. J (B) LOCATION: replace ( 11 12, "") 

|.j 25 (D) OTHER INFORMATION: /standard_name= "(any base) 

V, W 

(ix) FEATURE: 

^ (A) NAME /KEY : misc_diff erence 

s= (B) LOCATION: replace ( 16 17, "") 

□ 30 (D) OTHER INFORMATION: /standard_name= "(any base) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



p 35 AAAGATATTG GAAAGGATTG G 



21 
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